by Hsuan Huang
My personal understanding of the noindex, follow
meta tag for crawlers.
This article is a short personal reflection on basic SEO concepts. It covers the writer’s understanding of meta tags such as index
, noindex
, and follow
, and how they affect search engine behavior. It's intended for beginners or those just starting to explore SEO, while SEO experts may find it too elementary and can skip it.
A while ago at work, a colleague had a ticket about improving SEO. Although the scope and difficulty weren’t big, it piqued my curiosity. The main content was roughly :
On all /en-XX/ pages (eg., /en-AU/, /en-US/, /en-HK/, etc.), add: <meta name="robots" content="noindex, follow">
On all /ar-XX/ pages (e.g., /ar-SA/, /ar-AE/, etc.), add: <meta name="robots" content="noindex, follow">
To be honest, although I’m not the one handling it, I imagine it mainly involves determining which meta tags to add based on locales. The key point is understanding that these tags are specifically for search engine crawlers. Especially for e-commerce sites, SEO is a top priority.
I took a look, and I have three questions :
index
and noindex
?follow
mean?en
pages to be indexed, but not the en-XX
pages by search engines?By the way: en
stands for the language (English pages), and -XX
could be something like en-AU
, which represents the Australia region—so it’s an English page paired with the Australian dollar currency.
The first two questions are easy to understand :
The main question is the third one: indexing? link equity? I was curious too and did some research on why having meta tags on both language-only (en
) and language-region (en-US
) pages affects crawlers and, consequently, SEO rankings. Below is how I understand it:
Search engines may suspect you’re publishing duplicate content, which can lower the authority (ranking power) of both pages and negatively impact their rankings.
External links, clicks, and user interactions may be spread across multiple similar pages, preventing any single version from accumulating enough authority to rank well.
Google may be unsure which version to display, causing your main /en/ page to not consistently appear in search results.
That’s why we…
noindex
.Simply put: Although regional versions like the en-US pages are not indexed by Google, they still help redirect traffic and pass link equity to the main pages we want to rank.
Plain and simple summary : Not indexed = don’t include this webpage in search results, but it can still be crawled and visited.
The points above can really affect SEO rankings — this is actually the first time I learned this. If you’re also dealing with multilingual sites combined with regional versions, it’s worth considering how link equity influences crawlers and SEO!
References